Cvl Ocr Db, an Annotated Image Database of Text in Natural Scenes, and Its Usability

نویسندگان

  • Andrej Ikica
  • Peter Peer
چکیده

Text detection and optical character recognition (OCR) in images of natural scenes is a fairly new computer vision area but yet very useful in numerous applicative areas. Although many implementations gain promising results, they are evaluated mostly on the private image collections that are very hard or even impossible to get. Therefore, it is very difficult to compare them objectively. Since our aim is to help the research community in standardizing the evaluation of the text detection and OCR methods, we present CVL OCR DB, a public database of annotated images of text in diverse natural scenes, captured at varying weather and lighting conditions. All the images in the database are annotated with the text region and single character location information, making CVL OCR DB suitable for testing and evaluating both text detection and OCR methods. Moreover, all the single characters are also cropped from the original images and stored individually, turning our database into a huge collection of characters suitable for training and testing OCR classifiers. Anotirana podatkovna baza slik teksta v naravnih scenah CVL OCR DB in njena uporaba Kjučne besede: prenos radiofrekvenčnega signala prek optičnega vlakna, oddaljena antenska enota, celična arhitektura, vlakenski dostop, fiksno in mobilno zlivanje Izvleček: Detekcija teksta in optična razpoznava simbolov (OCR) na slikah naravnih scen je razmeroma novo področje računalniškega vida, pa vendar zelo uporabna na številnih aplikativnih področjih. Mnoge implementacije dosegajo spodbudne rezultate, vendar njihova evalvacija večinoma poteka na privatnih zbirkah slik, ki so težko dostopne ali celo nedostopne, zato je metode med seboj zelo težko objektivno primerjati. Naš namen je pomagati raziskovalni skupnosti pri standardizaciji evalvacije omenjenih metod. Zato predstavljamo CVL OCR DB, javno bazo anotiranih slik teksta v naravnih scenah, ki so zajete pod različnimi vremenskimi in svetlobnimi pogoji. Vse slike v bazi vsebujejo informacijo o lokacijah prisotnih tekstovnih regij in posameznih črk, kar omogoča testiranje in evalvacijo tako metod detekcije teksta, kot tudi metod razpoznave simbolov. Vsi posamezni znaki so dodatno izrezani iz originalnih slik ter individualno shranjeni, kar naredi našo podatkovno bazo ogromno zbirko znakov, primerno za učenje in testiranje klasifikatorjev OCR.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Natural scene text localization using edge color signature

Localizing text regions in images taken from natural scenes is one of the challenging problems dueto variations in font, size, color and orientation of text. In this paper, we introduce a new concept socalled Edge Color Signature for localizing text regions in an image. This method is able to localizeboth Farsi and English texts. In the proposed method rst a pyramid using diff...

متن کامل

NEOCR: A Configurable Dataset for Natural Image Text Recognition

Recently growing attention has been paid to recognizing text in natural images. Natural image text OCR is far more complex than OCR in scanned documents. Text in real world environments appears in arbitrary colors, font sizes and font types, often affected by perspective distortion, lighting effects, textures or occlusion. Currently there are no datasets publicly available which cover all aspec...

متن کامل

Document Image Dewarping Based on Text Line Detection and Surface Modeling (RESEARCH NOTE)

Document images produced by scanner or digital camera, usually suffer from geometric and photometric distortions. Both of them deteriorate the performance of OCR systems. In this paper, we present a novel method to compensate for undesirable geometric distortions aiming to improve OCR results. Our methodology is based on finding text lines by dynamic local connectivity map and then applying a l...

متن کامل

Towards Text Recognition in Natural Scene Images

In this paper, we propose a novel methodology for text detection in natural scene images. The proposed methodology is based on an efficient binarization and enhancement technique followed by a suitable connected component analysis procedure. Image binarization successfully processes natural scene images having shadows, non-uniform illumination, low contrast and large signaldependent noise. Conn...

متن کامل

A Morphological Image Preprocessing Suite for OCR on Natural Scene Images

As demand grows for mobile applications, research in optical character recognition (OCR), a technology well-developed for document imaging, is shifting focus to the recognition of text embedded in digital photographs or video. Segmenting text and background in natural scenes is a difficult classification problem, and the accuracy of this segmentation is of utmost importance when the output of a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011